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Abstract 


The coronavirus nucleocapsid (N) is a multifunctional phosphoprotein that encapsidates the genomic RNA into a helical nucleocapsid within the 
mature virion. The protein also plays roles in viral RNA transcription and/or replication and possibly viral mRNA translation. Phosphorylation is 
one of the most common post-translation modifications that plays important regulatory roles in modulating protein functions. It has been speculated 
for sometime that phosphorylation could play an important role in regulation of coronavirus N protein functions. As a first step toward positioning 
to address this we have identified the amino acids that are phosphorylated on the mouse hepatitis coronavirus (MHV) A59 N protein. High 
performance liquid chromatography coupled with electrospray ionization tandem mass spectrometry (HPLC-ESI-MS/MS) was used to identify 
phosphorylated sites on the N protein from both infected cells and purified extracellular virions. A total of six phosphorylated sites (S162, S170, 
T177, S389, S424 and T428) were identified on the protein from infected cells. The same six sites were also phosphorylated on the extracellular 


mature virion N protein. This is the first identification of phosphorylated sites for a group II coronavirus N protein. 


© 2007 Elsevier B.V. All rights reserved. 
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1. Introduction 


The Coronaviridae is a large family of medically important 
viruses that cause primarily respiratory and enteric infections in 
humans and a wide range of animals. The viruses are enveloped 
and contain a single-stranded positive-sense RNA genome that 
ranges in size from 27 to 31kb. All of the viruses have at 
least four structural proteins. The spike (S), membrane (M) 
and envelope (E) proteins are anchored in the virion envelope. 
The nucleocapsid (N) protein encapsidates the viral genome as 
a helical nucleocapsid inside the virion (Davies et al., 1981; 
Macnaughton et al., 1978). 

The focus of this report is the multifunctional N protein and 
its phosphorylation. Through its interactions with the viral RNA, 
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the M protein and itself, N plays important roles in virus assem- 
bly (Escors et al., 2001; Hurst et al., 2005; Narayanan et al., 2003; 
Narayanan and Makino, 2001; Verma et al., 2006). The protein 
is also involved in viral RNA transcription and/or replication 
(Baric et al., 1988; Chang and Brian, 1996; Compton et al., 1987; 
Denison et al., 1999; van der Meer et al., 1999). Recent stud- 
ies with coronavirus infectious clones provided direct evidence 
for a role of the N protein in replication and/or transcription 
(Almazan et al., 2004; Casais et al., 2001; Yount et al., 2000). 
The protein may also play a role in viral mRNA translation 
(Tahara et al., 1994). Additionally, MHV A59 and severe acute 
respiratory syndrome coronavirus (SARS-CoV) N proteins are 
type I interferon antagonists (Kopecky-Bromberg et al., 2007; 
Ye et al., 2007). 

All coronavirus N proteins are phosphorylated and highly 
basic with isoelectric points (pf) of 10.3 to 10.7 (Laude and 
Masters, 1995). The role of phosphorylation is not known 
and only very recently were phosphorylated sites identified 
for transmissible gastroenteritis virus (TGEV) and avian 
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infectious bronchitis virus (IBV) N proteins, group I and 
III viruses, respectively (Calvo et al., 2005; Chen et al., 
2005). 

In the study reported here we used high pressure liquid chro- 
matography coupled with electrospray ionization tandem mass 
spectrometry (HPLC-ESI-MS/MS) to identify the phosphory- 
lated sites on the MHV A59 N protein from both infected cells 
and purified extracellular virions. High accurate mass resolu- 
tion was achieved by ion cyclotron resonance (ICR) and Fourier 
transformation. Use of the ICR cell provides the most accurate 
mass data available today. A total of six phosphorylated sites 
(S162, S170, T177, S389, S424, and T428) were identified on 
the N protein taken from both infected cells and extracellular 
virions. This is the first identification of phosphorylated sites 
for a group II coronavirus N protein. 


2. Materials and methods 


2.1. Isolation of N protein from intracellular and 
extracellular fractions 


Mouse 17Cl1 cells were infected with 0.1 pfu/ml of MHV 
A59. Intracellular and extracellular fractions were harvested 
separately at 18h p.i. Cell culture supernatant containing extra- 
cellular virus was clarified by centrifugation to remove cell 
debris. Infected cells were rinsed with phosphate buffered saline 
chilled to 4°C and disrupted in ice cold lysis buffer (100 mM 
Tris-HCl [pH 7.5], 100 mM NaCl, 0.5% Triton X-100, 1 mM 
phenylmethylsulfony! fluoride, 1:100 dilution of phosphatase 
inhibitor cocktails 1 and 2 (Sigma), 10 wg RNase A). Nuclei 
were pelleted and the cytoplasmic lysates were resolved imme- 
diately by SDS-PAGE or stored at —80 °C. 

Virions were precipitated from the clarified extracellular 
supernatant by addition while mixing of polyethylene glycol 
(PEG) 8000 to a final concentration of ~12.5mM, followed 
by slow addition of NaCl to a final concentration of 400 mM. 
The precipitation was continued for 3h at 4°C with continuous 
stirring. Precipitates were collected by centrifugation at 4°C 
for 20 min at 10,000 x g. The precipitate was resuspended in 
TMEN (50mM Tris-HCl, 50mM maleic acid, 1mM EDTA, 
100mM NaCl) buffer [pH 6.0] and clarified at 10,000 x g for 
5 min. Clarified precipitated virus was purified in continuous 
20-60% (w/w) sucrose gradients that were centrifuged at 
90,000 x g for 4.5h at 4°C. Virions from pooled gradient 
fractions in the 1.16-1.20g/em? density range were subse- 
quently pelleted through a 30% sucrose cushion at 30,000 rpm 
in a Beckman SW 50.1 rotor. The purified virus was resus- 
pended in TMEN buffer [pH 6.0] and analyzed immediately 
or stored at —80°C. Virions were disrupted in ice cold lysis 
buffer as described above prior to resolution of proteins by 
SDS-PAGE. 


2.2. Isolation and preparation of the N protein for mass 
spectrometry 


Proteins from the intracellular and extracellular fractions 
were separated by SDS-PAGE in 8% gels and visualized by 


Coomassie staining. The N protein from both fractions was 
excised from the gel. Positions of the N protein were confirmed 
by Western blotting of proteins from parallel lanes. Gel pieces 
containing the N protein were destained twice with 300 wl of 
50% acetonitrile (ACN) in 40 mM NH4HCO;3 and dehydrated 
with 100% ACN for 15min. ACN was removed by aspiration 
and gel pieces were dried in a vacuum centrifuge at 60°C for 
20 min. Dried gel pieces were rehydrated in 20 wl of 40 mM 
NH4HCO3 containing 250 ng trypsin (Sigma) at 4°C for 15 min 
prior to the addition of 50 wl of 40 mM NH4HCO3 contain- 
ing 10fmol/wl angiotensin I. Trypsin digests were incubated 
overnight at 37°C. Protease activity was terminated by addi- 
tion of 10 wl of 5% formic acid (FA) and incubation at 37°C 
for 30 min. Digests were clarified for 1 min in a microcentrifuge 
and supernatants were removed. The extraction procedure was 
repeated using 40 pl of 5% FA. The resulting peptide mixtures 
were purified by solid-phase extraction using C18 ZipTips (Mil- 
lipore) after loading in 0.05% heptafluorobutyric acid: 5% FA 
(v/v) and elution with 50% ACN:1% FA (v/v). The samples were 
dried by vacuum centrifugation and dissolved in 4 wl of 0.1% 
FA:2%ACN (v/v). 


2.3. Mass spectrometry 


HPLC-ESI-MS/MS was performed on a Thermo Finnigan 
(San Jose, CA) LTQ-FTICR fitted with a PicoView™ nanospray 
source (New Objective, Woburn, MA). On-line HPLC was per- 
formed using a Michrom BioResources Paradigm MS4 micro 
two-dimensional HPLC (Alburn, CA) with a PicoFrit™ col- 
umn (New Objective, Woburn, MA, 75 wm i.d., packed with 
ProteoPep!™ II C18 material, 300 A); mobile phase, linear gra- 
dient of 2 to 27% ACN in 0.1% FA in 45 min, a hold of 5 min 
at 27% ACN, followed by a step to 50% ACN, a hold of 5 min 
and then a step to 80%; flow rate, 250 nl/min. 

A “top-10” data-dependent MS/MS analysis was performed 
(acquisition of a full scan spectrum followed by collision- 
induced dissociation (CID) mass spectra of the 10 most abundant 
ions in the survey scan) to identify N protein peptides. The sur- 
vey scan was acquired using the Fourier transform ion cyclotron 
resonance (FTICR) mass analyzer, which offers high mass 
accuracy and resolution. A list of potential phosphorylated 
peptides was generated based on detected serine/threonine- 
containing peptides from the N protein. For localization 
of phosphorylation sites a scan protocol of 1 survey scan 
(FTICR), followed by 7 targeted MS/MS scans (CID spectra 
of specified m/z values were acquired using the LTQ mass 
analyzer). All uninterpreted tandem MS data were searched 
using Mascot (Matrix Science, London, UK). Assignments of 
the phosphopeptides were confirmed by manual comparison 
of the tandem mass spectra with the predicted fragmenta- 
tion generated in silico by the MS-Product component of 
ProteinProspector (http://prospector.ucsf.edu). In the case of 
phosphopeptides containing more than one serine or threo- 
nine, we localized phosphorylation to particular residues by 
assigning fragment ion masses from the mass spectra that were 
unique to the fragmentation of a peptide phosphorylated at said 
residue. 
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Fig. 1. Representative SDS-PAGE used to isolate MHV N protein and peptide 
map coverage. (A) Proteins were visualized by Coomassie staining (left panel). 
Boxes indicate the areas of gel excision. The positions of N from purified virus 
(V) and MHV infected cells (I) are indicated to the right of both panels. Control 
uninfected cells (U) are also shown. Positions of molecular weight standards in 
kDa are shown on the left. Western blot analysis of parallel lanes was used to 
verify the identity of the N protein (right panel). A rabbit polyclonal N antibody 
was used to identify the N protein (Cologna et al., 2000). (B) Sequence coverage 
of peptides that identified phosphorylated resides is shown. Double lines indicate 
coverage for both intracellular and extracellular N protein fractions. Single lines 
identify additional coverage of the intracellular fraction. Dots mark predicted 
phosphorylation sites. 


3. Results 


3.1. MHV AS9N protein contains many potential 
phosphorylation sites 


The MHV A59 N protein consists of 454 amino acids. 
The protein contains 41 serine, 22 threonine and 11 tyrosine 
residues, approximately 9, 5 and 2.4%, respectively, of the total 
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amino acids. Of these, 24 serines, 5 threonines and | tyrosine 
are predicted by NetPhos 2.0 (http://www.cbs.dtu.dk/services/ 
NetPhos/) (Blom et al., 1999) to be potential phosphory- 
lation sites. To determine which of the predicted sites are 
indeed phosphorylated, the N proteins from both intracel- 
lular lysates and purified extracellular virions from MHV 
A59 infected mouse 17Cl1 cells were analyzed by HPLC- 
ESI/MS/MS. 


3.2. Identification of phosphorylated sites on extracellular 
virion N protein by HPLC-ESI-MS/MS 


To identify phosphorylated sites on the N protein in extra- 
cellular mature virions, virus was purified and proteins were 
separated by 1D SDS-PAGE and visualized by Coomassie stain- 
ing. The N protein band was excised (Fig. 1) and in-gel digested 
with trypsin. Peptide pools were extracted, desalted and ana- 
lyzed by on-line HPLC-ESI-MS/MS. 

Mass spectra were acquired using a Thermo Finnigan LTQ- 
FTICR. FTICR mass analyzer has the ability to obtain high mass 
accuracy and resolution while the LTQ mass analyzer can per- 
form tandem mass spectrometry (MS/MS and MS?) to obtain 
sequence information of peptides by collision-induced dissoci- 
ation (CID) of residues. It is one of the most sensitive instruments 
available for high mass resolution and mass accuracy for iden- 
tification of protein phosphorylation and complex proteomic 
studies. We obtained 54% sequence coverage (by MASCOT) 
of the virion N protein (Fig. 1B). 

The first phosphorylated site identified was serine 389 
(pS389). The identification was made from analysis of MS/MS 
spectra of the peptide N*82-3°4 (DGGADVVpSPKPQR) which 
exhibited the most intense peak at mass-to-charge ratio (m/z) of 
654.6, corresponding to [M+2H-98]** (Fig. 2). The mass is rep- 
resentative of a doubly charged ion with a decrease in mass of 98 
atomic mass units (amu), which is equal to the loss of one phos- 
phoric acid group (H3POx4) from the precursor ion m/z = 703.37". 
Since this peptide contained only one serine, MS? analysis was 
not necessary. Other intense peaks were assigned as b- and y- 
series sequence ions which result from the usual fragmentation 
of peptides at their amide bonds (Fig. 2). These results con- 
firmed preliminary results from earlier MALDI analysis in our 
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Fig. 2. CID MS/MS spectrum of phosphopeptide D3g2—R394 from tryptic digest of MHV virion N protein. Peaks correspond to fragments of the peptide containing 
residues 382-394 as indicated by the sequence shown with the spectrum. The most intense peak (m/z=654.6 [M+2H-98]°*) corresponds to a precursor ion 
(m/z=703.37*) with a+2 charge and minus H3PQ4. The position of phosphoserine at position 389 (pS) is indicated in the peptide sequence. The b-amino and 
y-carboxy series ions are indicated above peptide fragment peaks. Symbol (*) indicates loss of H3PO4 (98 amu) from a fragment. 
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Fig. 3. CID MS/MS and MS? spectra of phosphopeptide N4y2—R432 MHV virion N protein. Retention times (rt) for averaged spectra are indicated in each panel. (A) 
Precursor ion m/z= 698.3 was subjected to MS/MS. Peak m/z = 649.6 [M+2H-98]** corresponds to the precursor minus H3PO, and plus a+ 2 charge. Unique ions 
(arrows) m/z= 508.4 (yg2*) and 779.3 (bg) localized phosphorylation at S424. (B) MS? spectrum of the same peptide eluted at 28.70-28.93 min. Unique ion 859.5 y7 


confirmed phosphorylation at S424. (C) MS? spectrum of precursor ion m/z = 698 


32+ 


eluted at 25.49-25.64 min. A unique ion 699.3 be confirmed phosphorylation 


at T428. Symbol (§) indicates loss of a H2O molecule (18 amu). Symbol (*) indicates loss of H3PO4. 


lab which indicated that $389 was phosphorylated (White and 
Hogue, 2006). 

Two other phosphorylated amino acids, $424 and T428, 
were also identified in the carboxy end of the protein. Phos- 
phorylated S424 was detected in phosphopeptide, N*??~+3? 
(NVpSRELTPEDR). Precursor ion m/z=698.3 was subjected 
to both MS/MS and MS? fragmentation. The MS/MS spectrum 
exhibited the most intense peak at m/z 649.6, corresponding 
to [M+2H-98]** (Fig. 3A). Ions yg7* (m/z=508.4) and b¢ 
(m/z = 779.3) are unique ions which localized phosphorylation to 
serine 424, Observation of y7 (m/z = 859.5) in the MS? spectrum 
was also consistent with phosphorylation at $424 (Fig. 3B). 

Two HPLC-MS/MS peaks corresponding to phosphopeptide 
N*?2-432 were noted. Spectra generated from peptides that con- 
tained phosphorylated $424 had retention times between 28.67 
and 28.93 min (Fig. 3B), whereas the retention time for the pep- 
tides in the second peak was 25.49—25.64 min, roughly 3 min 


prior to the phosphorylated S424-containing peptides. Spectra 
of the latter identified T428 as phosphorylated. The fragment 
ion be (m/z= 699.3) in the MS? spectra was the unique identifier 
for phosphorylation at T428 (Fig. 3C). 

Additional sites in the amino terminal half of the virion N pro- 
tein were also identified. The precursor ion for peptide N!©7-!78 
(pSDIVERDPSSHEAIPTR) was m/z=995.0°*. A peak result- 
ing from the neutral loss of H3PO4 was observed as the most 
intense peak in the MS/MS spectra at m/z=946.27* (Fig. 4A). 
Observation of the appropriate yj9 ion at m/z= 1094.5 in the 
CID spectrum indicated that S162 was phosphorylated. Further 
analysis showed that T177 in the 162-178 peptide was also 
phosphorylated. Peptide N!°-!78 (DPpSSHEAIPTR) resulted 
from complete digestion of peptide 162-178 at arginine 167. 
The smaller peptide was fragmented and subjected to MS? anal- 
ysis which allowed us to identify $170 as also phosphorylated 
(Fig. 4B). 
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Fig. 4. CID spectra of phosphopeptides in the amino half of virion N protein. (A) MS? spectrum of phosphopeptide S162—R17g resulting from a missed cleavage at 
R167 is shown. Unique ion 1094.5 y19 indicates phosphorylation at S162. Three unique ions 815.4 b7, 1423.7 bj3, and 1536.7 by14 are all indicative of phosphorylation 
at T177. (B) MS? spectrum of the phosphopeptide consisting of residues 168-178 is shown with its signature ions indicated about the peaks. Unique ions (arrows) 


455.9 yg>* and 910.4 yg indicate phosphorylation at $170. 


3.3. Identification of N protein phosphorylation sites on 
MBAV N protein from infectect cells by HPLC-ESI-MS/MS 


To identify residues that are phosphorylated on the intracellu- 
lar pool of the N protein from infected cells, cytoplasmic lysates 
were resolved by SDS-PAGE. A sample of purified virions was 
run on the gel and used as a guide to excise the N protein band 
from the whole cell lysates (Fig. 2). Mass spectra were again 
acquired using HPLC-ESI-MS/MS. We obtained 69% sequence 
coverage (by MASCOT) of the intracellular N protein (Fig. 1B). 

In the intracellular N protein phosphopeptide N!©2-!78 was 
also detected. $162 and T177 were both phosphorylated. These 
phosphopeptides apparently coeluted and were subsequently 
fragmented together. Fig. SA is an example a MS? spectrum 
for N!©2-!78_ Detection of ions 797.5 y142* and 1094.5 yjo is 
characteristic of phosphorylation at S162, while bj2 and by4 
ions (m/z 1352.6 and 1536.7, respectively) are consistent with 
phosphorylation at residue T177. Phosphopeptide N!©2-!78 con- 
tained one site that was not cleaved by trypsin, but the fully 
cleaved phosphopeptide, N'°*-!78 (DPpSSHEAIPTR), was also 
detected, which allowed us to localize phosphorylation at $170 

as well (Fig. 5B and C). 

Further analysis of peptides identified phosphorylation at 
$389, S424, and T428, as was also determined for the extracel- 
lular virion protein (Fig. 6). Identification of S389 was straight 
forward due to $389 being the only possible target for phos- 
phorylation in peptide N*8?-3°4 (Fig. 6A). Phosphopeptides 
containing phosphorylated $424 and T428 coeluted in the intra- 
cellular samples, in contrast to the previous separation of these 
peptides for the extracellular samples. However, MS? fragmen- 


tation of the phosphopeptide indicated phosphorylation at both 
$424 and T428 (Fig. 6B and C). 


4. Discussion 


We used HPLC-ESI/MS/MS to identify the phosphorylated 
sites on the N protein of MHV A59. This is the first identifi- 
cation of amino acids that are phosphorylated on the N protein 
for a group II coronavirus. Phosphorylated sites were identi- 
fied on the protein from infected cells and mature extracellular 
virions. Six residues, $162, $170, T177, S389, S424 and T428, 
were found to be phosphorylated on both intracellular and extra- 
cellular virion N proteins (Fig. 7). Potential cellular kinases that 
may phosphorylate the identified sites were predicted using Net- 
PhosK 1.0 (http://www.cbs.dtu.dk/services/NetPhosK/) which 
produces neural network predictions of kinases specific for 
eukaryotic phosphorylation sites (Fig. 7) (Blom et al., 2004). 
It remains to be determined which are responsible for phospho- 
rylations of the N protein, but multiple potential kinases are 
predicted for all but one of the identified sites. 

Our results demonstrate that both serine and threonine 
residues are modified by phosphorylation, in contrast to ear- 
lier studies which suggested that MHV AS59, as well as closely 
related MHV JHM, N proteins are phosphorylated exclusively 
on serine residues (Siddell et al., 1981; Stohlman and Lai, 1979). 
The earlier conclusions were made based on phosphoamino 
acid analysis of [°?P]-labeled N peptides. Identification of the 
phosphorylated threonine residues in our study may reflect dif- 
ferences in the experimental approaches used to isolate and/or 

analyze the protein. Interestingly, based on comparative analysis 
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Fig. 5. CID fragmentation spectra of N phosphopeptides from the intracellular fraction of infected cells. (A) MS? spectrum of peptide S162—R17g. Unique ions 
(arrows) 797.5 yia*, and 1094.5 yio, as well as 1352.6 bj2, and 1536.7 by4 localized phosphorylation to $162 and T177, respectively. Other ions resulting from 
fragmentation are indicated. (s) and (t) indicates ions derived from either $162 and T177 phosphopeptides, respectively. (B) MS/MS spectrum of peptide N168-178. 
Neutral loss of H3PO4 from the precursor ion mass of 645.28 corresponds to the most intense peak at m/z = 596.5. Identification of unique ion (arrows) 455.9 yg +2 
localized phosphorylation to $170. (C) MS? spectrum of phosphopeptide 168-178. Identification of additional unique ions 264.1 b3-g§ and 910.6 yg confirmed the 
localization of phosphorylation at $170. Symbol (§) indicates masses corresponding to a loss of a H2O molecule. Symbol (*) indicates masses corresponding to a 


loss of H3PQq. 


of earlier HPLC of tryptic peptides derived from the N proteins 
of MHV A59 and a plaque variant of the JHM virus, it was sug- 
gested that serine at positions 161 and 162, respectively, were 
phosphorylated (Wilbur et al., 1986). Thus, S161 identified in 
the earlier study likely corresponds to $162 identified in our 
study. 

Alignment of the amino acid sequences for several members 
of the group II coronaviruses shows that, with the exception of 
$162, the sites are conserved in the bovine coronavirus (BCoV) 
and human coronavirus OC43 (HCoV OC43) N proteins (Fig. 8). 
While our work was in progress the phosphorylated sites on the 
N protein were identified for two other coronaviruses. Phospho- 
serines at positions 9, 156, 254 and 256 were identified for the 
group I TGEV protein from virus infected cells, whereas only 
sites S156 and $256 were identified on the N protein present in 
purified virions (Fig. 8) (Calvo et al., 2005). The sites that are 
phosphorylated on the group III IBV N protein expressed with 
baculovirus in insect cells were found to be identical to the ones 
on the N protein expressed alone in Vero cells (Chen et al., 2005). 


Data from the study suggests that $190, $192, T378 and S379 
are phosphorylated on intracellular IBV N (Fig. 8). Comparison 
of the sites for the three viruses illustrates that the location is not 
conserved, but phosphorylated sites are near either the amino 
or carboxy side of the serine/arginine (SR) rich domain that is 
conserved in all coronavirus N proteins. The SR rich domain is 
a distinguishing feature of the previously identified MHV RNA 
binding domain (Figs. 7 and 8). 

Recently structural information became available for IBV 
and SARS-CoV N proteins. The three-dimensional structures 
are based on NMR analysis of amino acids 45-181 (Huang et 
al., 2004) and X-ray crystal structures of the amino-terminal 
residues of IBV 19/29-160/162 (Fan et al., 2005; Jayaram et al., 
2006) and SARS-CoV 47-175 (Saikatendu et al., 2007). Crystal 
structures have also been determined for carboxy-terminal 
residues 219-349 of IBV N (Jayaram et al., 2006) and residues 
270-370 of SARS-CoV N (Yu et al., 2006). Based solely 
on amino acid alignments, this corresponds to the region 
encompassing roughly R45-N195 in the MHV A59 N protein 
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Fig. 6. CID induced MS/MS mass spectrum of peptides containing residues 382-394 and 422-432 from intracellular N. (A) MS/MS spectrum of residues 382-394. 
Loss of H3PO, from the precursor ion (m/z=703.37*) correlates with the most intense peak in the spectrum, m/z= 654.6 [M+2H-98]**. S389 is the only possible 
phosphorylation site in the peptide. (B) MS/MS spectrum of the peptide containing residues 422-432. Loss of H3POq from precursor (m/z=698.37*) correlates 
with the most intense peak in the spectrum, m/z= 649.6 [M+2H-98]?*. Unique ions 508.5 yg +2, 730.4 ye, and 779.3 be identify $424 as phosphorylated. (C) MS? 
fragmentation of the same peptide containing residues 422-432 allowed detection of unique ion 699.4 bg, indicative of phosphorylation at T428. Symbol (§) indicates 


loss of a H2O molecule. Symbol (*) indicates loss of H3PO4. 


RNA binding domain 


St62 Si0 Ti7 231 S3g0 S424 Ta28 
N[ Epp Pp p—lP Pp |coou 
1 139 163 283 «———» 380 406 454 
Putative dimerization domain 
Site Sequence Predicted Kinase* 
Sie TNTRpSDIVE CKII, CaM-II 
Si70 ERDPpSSHE PKA, GSK3, PKG 
Si71 ERDPSpSHE PKC, GSK3, CKI, PKC 
Ti EAIPpTRFAP PKC 
S3s9 ADVVpSPKPQ GSK3, cdk5 
Sag QRNVpSRELT PKA, CaM-II 
Typos SRELpTPEDR GSK3, cdk5, p33 MAPK 
* CKI, II — casine kinase I, Il, Ca2+/Calmodulin-dependent protein kinase II, 


PKC, A, G — protein king ”, A, G, GSK3 — glycogen synthetase kinase 3, 
cdk5 — cyclin dependent kinase 5, p33MAPK, mitrogen-activated protein kinase | 


Fig. 7. Schematic illustration of MHV N three-domain model separated by the 
A and B spacer domains (Parker and Masters, 1990). The relative positions of 
the phosphorylated sites identified on intracellular and extracellular virion N 
are shown. The positions of the RNA binding domain (Nelson et al., 2000) and 
putative dimerization domain (Yu et al., 2006) are noted. Eukaryotic kinases 
that may phosphorylate the identified sites were predicted by NetPhosK 1.0 
(http://www.cbs.dtu.dk/services/NetPhosK/) are shown in the table below the 
schematic. 


(Fig. 8). No structural information is yet available for MHV 
N, but based on the sequence alignments we can speculate 
about the potential positions of the phosphorylated sites on 
MHV N. The sites in the amino terminal region (S162, $170, 
T177) potentially fall within a large loop between B-strands 
(86 and B7 of SARS amino-terminus) (Saikatendu et al., 2007). 
These sites are adjacent to the previously identified MHV RNA 
binding domain that extends from T177 to P231 (Figs. 7 and 8) 
(Nelson et al., 2000). Localization of the phosphorylation sites 
near this domain could influence presentation of the RNA 
binding domain and in turn interaction with the RNA. Clearly, 
much remains to be understood about, not only the structure of 
the MHV N protein, but also its functional domains compared 
with other coronavirus nucleocapsids. RNA binding domains 
for IBV and SARS-CoV are located in the amino-terminal 
domain preceding the serine-rich domain (Fan et al., 2005; 
Huang et al., 2004; Saikatendu et al., 2007). Mutagenesis of 
a positively charged B-hairpin within the amino-terminus of 
IBV N identified residues involved in RNA-binding in vitro, 
and when introduced into an infectious clone virus replication 
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MHV -MSFVPGQENAGGRSSSVNRAGNGILKKTTWADQTERG PNNONRGRR-NOQPKQTATTQ-PNSGSVVPHYSWFSGITOQFQKGKEFQ 82 

BCV -MSFTPGKQSSS-RASFGNRSGNGILK---WADQSDQSRNVOTRGRR-AQPKOTATSQLPSGGNVVPYYSWFSGITQFQKGKEFE 79 

0c43 -MSFTPGKQSSS-RASSGNRSGNGILK---WADQSDQFRNVQTRGRR-AQPKOTATSQQPSGGNVVPYYSWFSGITQFQKGKEFE 79 

SARS ---MSDNGPQSNQRSAPRITFGG---------- PTDSTDNNONGGRNGARPKORRPQGLPNNT----- ASWFTALTQHGK-EELR 66 

TGEV MANQGQRVS§WGD-------------------------- ESTKTRGRSNSRGRKNNNI P---------- LSFFNPITLQQGSKFWN 49 

229E ----MATVKWAD-------------------------- ASEPQRGR------- QGRIP---------- YSLYSPLLVD-SEQPWK 37 

IBV ----MASGKAAG----------------------- KTDAPAPVIKLGGPKPPKVGSSG--------- NASWFQAIKAKKLNTPPP 49 

MHV FAEGQGVP IANGI PASEQKGYWY RHNRRSFKTPDGQOKQLLPRWY FYYLGTGPHAGASYGDSIEGVFWVANSQADTNTRSDIVER 167 
BCV FAEGQGVP IAPGVPATEAKGYWY RHNRRSFKTADGNORQLLPRWY FY Y LGTGPHAKDQYGTDIDGVFWVASNQADVNTPADILDR 164 
0c43 FAEGQGVP IAPGVPATEAKGYWY RHNRRSFKTADGNQRQLLPRWY FY Y LGTGPHAKDQYGTDIDGVYWVASNQADVNTPADIVDR 164 
SARS FPRGQGVPINTNSGPDDQIGYYRRATRR-VRGGDGKMKELSPRWY FY YLGTGPEASLPYGANKEGIVWVATEGALNTPKDHIGTR 150 
TGEV LCPRDFVPKGIG-NRDQQIGYWN--ROTRYRMVKGORKELPERWEFFY Y LGTGPHADAKFKDKLDGVVWVAKDGAMNKPTTLGSRG 131 
229E VIPRNLVPINKK-DKNKLIGYWN--VQKRFRTRKGKRVDLSPKLHFY YLGTGPHKDAKFRERVEGVVWVAVDGAKTEPTGYGVRR 119 
IBV KFEGSGVPDNENIKPSQQHGYWR--RQARFKPGKGGRKPVPDAWY FY YTGTGPAADLNWGDTQDGIVWVAAKGADTKSRSNQGTR 132 
MHV DPSSHEAI PT--RFAPGTVLPQGFY VEGSGRSAPA|SRSGSRSQSRIGP--NNRARSSSNQRQPASTVKPDMAEEIAALVLAKLGKD 248 
BCV DPSSDEAI PT--RFPPGTVLPQGYY IEGSGRSAPN|SRSTSRASSRIASSAGSRSRANSGNRTPTSGVTPDMADQIASLVLAKLGKD 247 
0c43 DPSSDEAI P2--RFPPGTVLPQGYY IEGSGRSAPN|SRSTSRTSSRIASSAGSRSRANSGNRTPTSGVTPDMADQIASLVLAKLGKD 247 
SARS NPNNNAATVL--QLPQGTTLPKGFYAEGS-RGGSQAS|SRSSSRSRGNSRNSTPGSSRGN-SPARMASGGGETALALLLLDRLNQL 231 
TGEV ANNESKALKFDGKVP-GEFQLEVNO|SRDNSRSRSQSRSRSRNRSQSRIGRQ---QFNNKKDDSVEQAVLAALKKLGVDTEKQQOR- 211 
229E KNSEPEI PHFNQKLPNGVTVVEEPD|SRAPSRSQSRSQSRIGRGESKPQSRNPSSDRNHNSQDDIMKAVAAALKSLGFDKPQEKDKK 205 
IBV DPDKFDQY PL--RFSDGGPDGNFRWDF'I PLNRGRSGRSTAAS SAAA[SRAPSREGSRIGRRSDSGDDLIARA AKIIQDQ------- 207 
MHV AGQP-------------------- KQVTKQSAKKVROKILNKPROKRT PNK--QCPVQQCFGKRGPNQ---NFGGSEMLKLGTSD 308 
BCV ATKP-------------------- QOVTKOTAKEIRQKILNKPROKRSPNK--QCTVQQCFGKRGPNQ---NFGGGEMLKLGTSD 307 
0c43 ATKP-------------------- QOVTKHTAKEVROKILNKPRQKRSPNK--QCTVQQCFGKRGPNQ---NFGGGEMLKLGTSD 307 
SARS ESKVS--------------- GKGQQQQGOTVTKKSAAEASKKPROKRTATK--QYNVTQAFGRRGPEQTOQGNFGDQDLIRQGTDY 299 
TGEV 0 -------------------- SRSKSKERSNSKTRDTTPKNENKHTWKRTAG---KGDVTRFYGARSSSA---NFGDTDLVANGSSA 271 
229E SAKTGT PKPSRNQSPASSQTSAKSLARSQSSETKEQKHEMOKPRWKROPNDDVTSNVTOQCFGPRDLDH---NFGSAGVVANGVEKA 286 
IBV 000 w----------------------- OQKKGSRITKAKADEMAHRRYCKRTIPP--NYRVDQVFGPRTKGK-EGNFGDDKMNEEGIKD 265 
MHV PQFPILAELAPTVGAFFFGSKLELVKKN--SGGADEPTKDVYELQY SGAVRFDSTLPGFETIMKVLNENLNAYQKDGGADVV§PK 391 
BCV PQFPI LAELAPTAGAFFFGSRLELAKVONLSGNLDEPQKDVYELRYNGAIRFDSTLSGFETIMKVLNENLNAYQQQDGMMNMSPK 392 
0c43 PQFPILAELAPTAGAFFFGSRLELAKVONLSGNPDEPOQKDVYELRYNGAIRFDSTLSGFETIMKVLNENLNAYQQQDGMMNMSPK 392 
SARS KHWPQIAQFAPSASAFFGMSRIGMEVTP----------- SGTWLTYHGAIKLDDKDPQFKDNVILLNKHIDAYKTFPPTEPKKDK 373 
TGEV KHY PQLAECVPSVSSILFGSYWTSKEDG----------- DQIEVTFTHKYHLPKDDP----KTGQFLQQINAYARPSEVAKEQRK 341 
229E KGY PQFAELVPSTAAMLFDSHIVSKESG----------- NTVVLTFTTRVTVPKDHP----HLGKFLEELNAFTR------ EMQQ 350 
IBV GRVTAMLNLVPSSHACLFGSRVTPKLQL----------- DGLHLRFEFTTVVPCDDPQFDNYVKICDQCVDGVGTRPKDDEPKPK 339 
MHV PQRKGRRQAQEKKDEVDNVSVAKPKSSVQRNVSRELEPEDRSLLAQI LDDGVVPDGLEDDSNV----------- 454 
BCV PQRQ--RGQKNGQGENDNI SVAAPKSRVQONKSRELTAEDI SLLKKMDEP----- YTEDTSEI----------- 448 
0c43 PQRQ--RGHKNGQGENDNI SVAVPKSRVQQNKSRELTAEDI SLLKKMDEP---~-- YTEDTSEI----------- 448 
SARS KKKT-DEAQPLPQROKKQPTVTLLPAADMDDFSRQLONSMSGASADSTQA------------------------- 422 
TGEV RKSR-------- SKSAERSEQDVVPDALIENYTDVFDDTQVEI I DEVIN-------------------------- 382 
229E HPLL-------- NPSALEFNPSQTSPATAEPVRDEVS-IETDIIDEVN--------------------------- 391 
IBV SRSS----- SRPATRGNSPAPRQQRPKKEKKLKKQDDEADKALTSDEERNNAQLEFY DEPKV INWGDAALGENEL 409 


Fig. 8. Alignment of representative coronavirus N protein amino acid sequences. Positions of the phosphorylated sites on MHV N, conserved resides in the group II 
viruses BCoV, HCoV OC43 and SARS CoV, and recently identified sites on IBV (Chen et al., 2005) and TGEV (Calvo et al., 2005) N proteins are highlighted. The 
conserved S/R rich domains are boxed in each sequence. Sequences for MHV A59 (P03416), BCoV (P10527), HCov OC43 (Q696P4), SARS CoV (P59595), TGEV 
(P04134), HCoV 229E (P15130) and IBV CoV (P69596) were obtained from Swiss-Prot. The alignment was generated using ClustalW (Thompson et al., 1994). 


was attenuated (Tan et al., 2006). Kinetic analysis of the 
amino-terminus of IBV N showed that the domain binds viral 
RNA, but the full length protein exhibits stronger binding, 
suggesting that interactions with other regions of the protein 
are also important (Spencer and Hiscox, 2006). The conserved 
SR domain has been proposed to link the amino-terminal RNA 
binding and carboxy-terminal multimerization domains of IBV 
and SAR-CoV (Fan et al., 2005; Saikatendu et al., 2007). All 
of the identified phosphorylation sites on the IBV N protein 
are located on the carboxy side of its SR domain. The presence 
of phosphorylated residues in close proximity to the MHV SR 
domain may uniquely influence how the surrounding domains 
interact with viral RNA. Modeling of other coronavirus N 
proteins based on the crystal forms of SARS-CoV and IBV 
amino-terminal domains indicate that the proteins are similar in 
the overall organization of B-strands, but since the proteins dif- 


fer in their surface charge distribution patterns, it was speculated 
that the residues involved in RNA binding and how they interact 
with the RNA is different (Saikatendu et al., 2007). In the case of 
MHV N, the presence of phosphorylated residues between the 
amino-end and the SR domain may contribute to this difference. 

Only phosphorylated $389 is included in what can be specu- 
lated to mirror on the IBV carboxy end structure. Interestingly, 
this would place the S389 site only a few residues beyond the 
carboxy terminal «5 helix in the IBV structure (Jayaram et al., 
2006). Phosphorylation at this site, as well as the downstream 
$424 and T428 sites, could be important for multimerization 
since they fall just beyond the region of MHV N that has been 
predicted to correspond to the carboxy terminal dimerization 
domain on the SARS-CoV N structure (Fig. 7) (Yu et al., 2006). 

Recent data suggest that IBV N protein binding to viral RNA 
is influenced by phosphorylation since nonphosphorylated N 
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protein bound both viral and nonviral RNAs, whereas the phos- 
phorylated N exhibited a higher binding affinity for viral RNAs 
(Chen et al., 2005). The results are consistent with earlier sug- 
gestions for a potential role of N phosphorylation (Laude and 
Masters, 1995; Nelson et al., 2000). Affinity for viral RNA 
could play a role in assembly or uncoating. Dephosphorylation 
of MHV N protein has been suggested to facilitate infection 
(Mohandas and Dales, 1991). Virion associated kinase activity 
has been reported for MHV JHM, but whether it plays any role 
during infection is not known (Siddell et al., 1981). Phosphory- 
lation of N could be important in whatever role the protein plays 
in viral RNA transcription and/or replication. Phosphorylation 
may alter the structure of the N protein and in turn presenta- 
tion of RNA binding domain(s) that is important for recognition 
of the packaging signal, transcription regulatory sequences or 
other signature sequences in the viral RNA(s). It was recently 
demonstrated that TGEV and SARS-CoV N proteins have RNA 
chaperone activity (Zuniga et al., 2007). We fully agree with the 
suggestion that phosphorylation may have a role in this activity. 

Samples analyzed in this study reflect steady state levels of 
phosphorylation for the N protein present in virus infected cells 
at 18h p.i. We did not detect any difference in phosphorylation 
of the protein from the infected cells and that in extracellu- 
lar virions. It should be noted that coronaviruses assemble at 
intracellular membranes in the region of the endoplasmic retic- 
ulum Golgi complex (ERGIC) (Krijnse-Locker et al., 1994). 
The details of virus release are not fully understood, but virions 
are thought to transport through the constitutive secretory path- 
way after budding. Thus, the intracellular fraction in our study 
no doubt contained some assembled virions that had not been 
secreted, which would contribute to the profile of sites that were 
identified. Our identification of the same sites on both intracel- 
lular and virion N differs from the results from recent analysis 
of TGEV N as discussed above. Four sites were found to be 
phosphorylated on the protein from infected cells, while only 
two of these were identified on extracellular virion N (Calvo 
et al., 2005). What accounts for the difference is not known. 
It may reflect a basic difference between MHV and TGEV. 
Calvo et al. (2005) suggested the possibility that phosphory- 
lation/dephosphorylation could play a role in TGEV assembly. 
We previously suggested that a more highly phosphorylated iso- 
form of the N protein exists in BCoV (Hogue, 1995) and MHV 
infected cells (Hogue, unpublished data). We also hypothesized 
that nucleocapsids might undergo dephosphorylation during 
assembly into virions. We did not attempt to identify phospho- 
rylation sites on different isoforms in the present study. Excised 
gel pieces included both forms of the protein that we previously 
suggested might be differentially phosphorylated based solely 
on phosphatase digestion results. Others have reported differ- 
ences in the phosphorylation status of intracellular and virions 
N proteins. IBV virion N protein was reported to be more phos- 
phorylated than the protein in virus infected cells (Jayaram et 
al., 2005). 

Our identification of phosphorylation sites for MHV N pro- 
tein is an important step toward deciphering the functional 
role(s) of the modification during the virus life cycle. Know- 
ing where phosphorylated sites are located on the protein will 


help direct future molecular studies to address how the modifica- 
tion contributes to or modulates functions of the multifunctional 
N protein. 
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